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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/103,287 



DATE: 07/06/98 
TIME: 14:24:28 



INPUT SET: S27178.raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



SEQUENCE LISTING 



General 'Information: 



(i) APPLICANT: Wallis, Nicola G . 

Burnham, Martin K. R. 



ENTERED 



(ii) TITLE OF INVENTION: murC 



(iii) NUMBER OF SEQUENCES: 6 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Dechert, Price & Rhoads 

(B) STREET: 4000 Bell Atlantic Tower, 1717 Arch Stre 

(C) CITY: Philadelphia 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19103-2793 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 95 

(D) SOFTWARE: FastSEQ for Windows Version 2,0b 

<vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/052,720 

(B) FILING DATE: 03-JUL-1997 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Falk, Stephen T 

(B) REGISTRATION NUMBER: 36,795 

(C) REFERENCE/ DOCKET NUMBER: GM10025 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 215-994-2488 

(B) TELEFAX: 215-994-2222 

(C) TELEX: 
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RAW SEQUENCE LISTING DATE: 07/06/98 

PATENT APPLICATION US/09/103,287 TIME: 14:24:28 

INPUT SET: S27178.raw 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATGAGTAAGG AGTTTTATAT AATGACACAC TATCATTTTG TCGGAATTAA AGGTTCTGGC 60 

ATGAGTTCAT TAGCACAAAT CATGCATGAT TTAGGACATG AAGTTCAAGG ATCGGATATT 120 

GAGAACTACG TATTTACAGA AGTTGCTCTT AGAAATAAGG G GAT AAA A AT ATTACCATTT 180 

GGTGCTAATA ACATAAAAGA AGATATGGTA GTTATACAAG GTAATGCATT CGCGAGTAGC 240 

CATGAAGAAA TAGTACGTGC ACATCAATTG AAATTAGATG TTGTAAGTTA TAATGATTTT 300 

TTAGGACAGA TTATTGATCA ATATACTTCA GTAGCTGTAA CTGGTGCACA TGGTAAAACT 360 

TCTACAACAG GTTTATTATC ACATGTTATG AATGGTGATA AAAAGACTTC ATTTTTAATT 420 

GGTGATGGCA CAGGTATGGG ATTGCCTGAA AGTGATTATT TCGCTTTTGA GGCATGTGAA 480 

TATAGACGTC ACTTTTTAAG TTATAAACCT GATTACGCAA TTATGACAAA TATTGATTTC 540 

GATCATCCTG ATTATTTCAA AGATATTAAT GATGTTTTTG ATGCATTCCA AGAAATGGCA 600 

CATAATGTTA AAAAAGGTAT TATTGCTTGG GGTGATGATG AACATCTACG TAAAATTGAA 660 

GCAGATGTTC CAATTTATTA CTATGGATTT AAAGATTCGG ATGACATTTA TGCTCAAAAT 720 

ATTCAAATTA CGGATAAAGG TACTGCTTTT GATGTGTATG TGGATGGTGA GTTTTATGAT 780 

CACTTCCTGT CTCCACAATA TGGTGACCAT ACAGTTTTAA ATGCATTAGC TGTAATTGCG 840 

ATTAGTTATT TAGAGAAGCT AGATGTTACA AATATTAAAG AAGCATTAGA AACGTTTGGT 900 

GGTGTTAAAC GTCGTTTCAA TGAAACTACA ATTGCAAATC AAGTTATTGT AGATGATTAT 960 

GCACACCATC CAAGAGAAAT TAGTGCTACA ATTGACACAG CACGAAAGAA ATATCCACAT 1020 

AAAGAAGTTG TTGCAGTATT TCAACCACAC ACTTTCTCTA GAACACAAGC ATTTTTAAAT 1080 

GAATTTGCAG AAAGTTTATG TAAAGCAGAT CGTGTATTCT TATGTGAAAT TTTTGGCTCA 1140 

AT TAG AG AAA ATTCTGGCGC ATTAACGATA CAAGATTTAA TTGATAAAAT TGGAGGTGCA 1200 

TCGTTCATTA ATGAAGATCT TATTAATGTA TTAGAACAAT TTGATAATGC TGTTGTTTTA 1260 

TTTATGGGTG CAGGTGATAT TCAAAAATTA CAAAATGCAT ATTTAGATAA ATTAGGCATG 13 20 

AAAAATGCGT TTTAATATGT TTATAATAGA G 1351 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Thr His Tyr His Phe Val Gly lie Lys Gly Ser Gly Met Ser Ser 

15 10 15 

Leu Ala Gin lie Met His Asp Leu Gly His Glu Val Gin Gly Ser Asp 

20 25 30 

lie Glu Asn Tyr Val Phe Thr Glu Val Ala Leu Arg Asn Lys Gly lie 



PAGE: 3 RAW SEQUENCE LISTING DATE: 07/06/98 

PATENT APPLICATION US/09/103,287 TIME: 14:24:29 

INPUT SET; S27178.raw 

100 35 40 45 

101 Lys lie Leu Pro Phe Gly Ala Asn Asn lie Lys Glu Asp Met Val Val 

102 50 55 60 

103 He Gin Gly Asn Ala Phe Ala Ser Ser His Glu Glu He Val Arg Ala 

104 65 70 75 80 

105 His Gin Leu Lys Leu Asp Val Val Ser Tyr Asn Asp Phe Leu Gly Gin 

106 85 90 95 

107 He lie Asp Gin Tyr Thr Ser Val Ala Val Thr Gly Ala His Gly Lys 

108 100 105 110 

109 Thr Ser Thr Thr Gly Leu Leu Ser His Val Met Asn Gly Asp Lys Lys 

110 115 120 125 

111 Thr Ser Phe Leu He Gly Asp Gly Thr Gly Met Gly Leu Pro Glu Ser 

112 130 135 140 

113 Asp Tyr Phe Ala Phe Glu Ala Cys Glu Tyr Arg Arg His Phe Leu Ser 

114 145 150 155 160 

115 Tyr Lys Pro Asp Tyr Ala He Met Thr Asn He Asp Phe Asp His Pro 

116 165 170 175 

117 Asp Tyr Phe Lys Asp He Asn Asp Val Phe Asp Ala Phe Gin Glu Met 

118 180 185 190 

119 Ala His Asn Val Lys Lys Gly He He Ala Trp Gly Asp Asp Glu His 

120 195 " 200 ' 205 

121 Leu Arg Lys He Glu Ala Asp Val Pro He Tyr Tyr Tyr Gly Phe Lys 

122 210 215 220 

123 Asp Ser Asp Asp He Tyr Ala Gin Asn He Gin He Thr Asp Lys Gly 

124 225 230 235 240 

125 Thr Ala Phe Asp Val Tyr Val Asp Gly Glu Phe Tyr Asp His Phe Leu 

126 245 250 255 

127 Ser Pro Gin Tyr Gly Asp His Thr Val Leu Asn Ala Leu Ala Val He 

128 260 265 270 

129 Ala He Ser Tyr Leu Glu Lys Leu Asp Val Thr Asn He Lys Glu Ala 

130 275 280 285 

131 Leu Glu Thr Phe Gly Gly Val Lys Arg Arg Phe Asn Glu Thr Thr He 

132 290 295 300 

133 Ala Asn Gin Val He Val Asp Asp Tyr Ala His His Pro Arg Glu He 

134 305 310 315 320 

135 Ser Ala Thr He Asp Thr Ala Arg Lys Lys Tyr Pro His Lys Glu Val 

136 325 330 335 

137 Val Ala Val Phe Gin Pro His Thr Phe Ser Arg Thr Gin Ala Phe Leu 

138 340 345 350 

139 Asn Glu Phe Ala Glu Ser Leu Cys Lys Ala Asp Arg Val Phe Leu Cys 

140 355 360 365 

141 Glu He Phe Gly Ser He Arg Glu Asn Ser Gly Ala Leu Thr He Gin 

142 370 375 380 

143 Asp Leu He Asp Lys He Gly Gly Ala Ser Phe lie Asn Glu Asp Leu 

144 385 390 395 400 

145 He Asn Val Leu Glu Gin Phe Asp Asn Ala Val Val Leu Phe Met Gly 

146 405 410 415 

147 Ala Gly Asp He Gin Lys Leu Gin Asn Ala Tyr Leu Asp Lys Leu Gly 

148 420 425 430 

149 Met Lys Asn Ala Phe 

150 435 
151 

152 (2) INFORMATION FOR SEQ ID NO : 3 : 



* 

PACE: 4 RAW SEQUENCE LISTING DATE: 07/06/98 

PATENT APPLICATION US/09/103,287 TIME: 14:24:30 

INPUT SET: S27178.raw 

153 

154 (i) SEQUENCE CHARACTERISTICS: 

155 (A) LENGTH: 660 base pairs 

156 (B) TYPE: nucleic acid 

157 (C) STRANDEDNESS: double 

15 8 (D) TOPOLOGY: linear 
159 

160 

161 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
162 

16 3 ATTTAAAGAT TCGGATGACA TTTATGCTCA AATATTTCAA ATTACGGATA AAGGTACTGC 60 

164 TGTTGATGTG TATGTGGATG GTGAGTTTTA TGATCACTTC CTGTCTCCAC AATATGGTGA 120 

165 CCATACAGTT TTAAATGCAT TAGCTGTAAT TGCGATTAGT TATTTAGAGA AGCTAGATGT 180 

166 TACAAATATT AAAGAAGCAT TAGAAACGTT TGGTGGTGTT AAACGTCGTT TCAATGAAAC 240 

167 TACAATTGCA AATCAAGTTA TTGTAGATGA TTATGCACAC CATCCAAGAG AAATTAGTGC 300 

168 TACAATTGAC ACAGCACGAA AGAAATATCC ACATAAAGAA GTTGTTGCAG TATTTCAACC 360 

16 9 ACACACTTTC TCTAGAACAC AAGCATTTTT AAATGAATTT GCAGAAAGTT TAAGTAAAGC 420 

170 AGATCGTGTA TTCTTATGTG AAATTTTTGG ATCAATTAGA GAAAATACTG GCGCATTAAC 480 

171 GATACAAGAT TTAATTGATA AAATTGAAGG TGCATCGTTA ATTAATGAAG ATTCTATTAA 540 

172 TGTATTAGAA CAATTTGATA ATGCTGTTGT TTTATTTATG GGTGCAGGTG ATATTCAAAA 600 

17 3 ATTACAAAAT GCATATTTAG ATAAATTAGG CATGAAAAAT GCGTTTTAAT ATGTTTATAA 660 
174 

175 (2) INFORMATION FOR SEQ ID NO : 4 : 
176 

177 (i) SEQUENCE CHARACTERISTICS: 

178 (A) LENGTH: 215 amino acids 

179 (B) TYPE: amino acid 

180 (C) STRANDEDNESS: single 

181 (D) TOPOLOGY: linear 
182 

183 

184 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
185 

186 Phe Lys Asp Ser Asp Asp lie Tyr Ala Gin lie Phe Gin lie Thr Asp 

187 1 5 10 15 

188 Lys Gly Thr Ala Val Asp Val Tyr Val Asp Gly Glu Phe Tyr Asp His 

189 20 25 30 

190 Phe Leu Ser Pro Gin Tyr Gly Asp His Thr Val Leu Asn Ala Leu Ala 

191 35 40 45 

192 Val lie Ala lie Ser Tyr Leu Glu Lys Leu Asp Val Thr Asn lie Lys 

193 50 55 60 

194 Glu Ala Leu Glu Thr Phe Gly Gly Val Lys Arg Arg Phe Asn Glu Thr 

195 65 70 75 80 

196 Thr lie Ala Asn Gin Val lie Val Asp Asp Tyr Ala His His Pro Arg 

197 85 90 95 

198 Glu lie Ser Ala Thr lie Asp Thr Ala Arg Lys Lys Tyr Pro His Lys 

199 100 105 110 

200 Glu Val Val Ala Val Phe Gin Pro His Thr Phe Ser Arg Thr Gin Ala 

201 115 120 125 

202 Phe Leu Asn Glu Phe Ala Glu Ser Leu Ser Lys Ala Asp Arg Val Phe 

203 130 135 140 

204 Leu Cys Glu lie Phe Gly Ser lie Arg Glu Asn Thr Gly Ala Leu Thr 

205 145 150 155 160 



PAGE: 5 RAW SEQUENCE LISTING DATE: 07/06/98 

PATENT APPLICATION US/09/103,287 TIME: 14:24:31 

INPUT SET: S27178. raw 

206 lie Gin Asp Leu lie Asp Lys lie Glu Gly Ala Ser Leu lie Asn Glu 

207 165 170 175 

208 Asp Ser lie Asn Val Leu Glu Gin Phe Asp Asn Ala Val Val Leu Phe 

209 180 185 190 

210 Met Gly Ala Gly Asp lie Gin Lys Leu Gin Asn Ala Tyr Leu Asp Lys 

211 195 200 205 

212 Leu Gly Met Lys Asn Ala Phe 

213 210 215 
214 

215 (2) INFORMATION FOR SEQ ID NO: 5: 

216 

217 (i) SEQUENCE CHARACTERISTICS: 

218 (A) LENGTH: 19 base pairs 

219 (B) TYPE: nucleic acid 

220 (C) STRANDEDNESS : single 

221 (D) TOPOLOGY: linear 
222 

223 

224 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

225 

226 CTTCATTAAT GAACGATGC 19 
227 

228 (2) INFORMATION FOR SEQ ID NO: 6: 

229 

2 30 (i) SEQUENCE CHARACTERISTICS: 

231 (A) LENGTH: 19 base pairs 

232 (B) TYPE: nucleic acid 

2 33 (C) STRANDEDNESS: single 

234 (D) TOPOLOGY: linear 

235 
236 

2 37 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

238 

2 39 GTTACAAATA TTAAAGAAG 19 



SEQUENCE VERIFICATION REPORT 

PATENT APPLICATION US/09/103,287 



DATE: 07/06/98 
TIME: 14:24:32 



INPUT SET: S27178.mw 



Original Text 



